Feeds to Scour
SubscribedAll
Scoured 9575 posts in 2.54 s
The art of text (rendering) (39c3)
cdn.media.ccc.deยท12h
๐Ÿ–‹Typography
Preview
Report Post
I built a production-ready document parser for RAG apps that actually handles complex tables (full tutorial + code)
dev.toยท1dยท
Discuss: DEV
๐Ÿ“‹Document Grammar
Preview
Report Post
GHC 9.12.3 is now available
haskell.orgยท1d
๐Ÿ’งLiquidhaskell
Preview
Report Post
alexchunt90/joyce: Joyce: A Reader and Editor for Hypertext
github.comยท1dยท
Discuss: Hacker News
๐Ÿ”—Hypermedia APIs
Preview
Report Post
TRUNAJOD: A text complexity library for text analysis built on spaCy โ€” TRUNAJOD 0.1.1 documentation
trunajod20.readthedocs.ioยท12h
๐Ÿ“Parsing Grammars
Preview
Report Post
WikiLatih Wiktionary with the Goethe-Institut: Strengthening the Digital Presence of Indonesiaโ€™s Local Languages
diff.wikimedia.orgยท5d
๐Ÿ“œBinary Philology
Preview
Report Post
Document Parsing with LLMs: From OCR to Structural Understanding.
alamedadev.comยท3d
๐Ÿ“‹Document Grammar
Preview
Report Post
Where Did This Sentence Come From? Tracing Provenance in LLM Reasoning Distillation
arxiv.orgยท2d
๐ŸงฎTheorem Proving
Preview
Report Post
blog dds: 2025-12-23 โ€” An initial analysis of the discovered Unix V4 tape
spinellis.grยท3dยท
Discuss: Hacker News
๐ŸงฌBitstream Evolution
Preview
Report Post
Building a PDF Ingestion Pipeline with TypeScript, Wasp, and AI OCR
dev.toยท3dยท
Discuss: DEV
๐Ÿ“„Document Streaming
Preview
Report Post
Show HN: Ragctl โ€“ document ingestion CLI for RAG (OCR, chunking, Qdrant)
github.comยท4dยท
Discuss: Hacker News
๐Ÿ“„Document Streaming
Preview
Report Post
libxml2 Narrowly Avoids Becoming Unmaintained
hackaday.comยท4dยท
๐ŸทXML
Preview
Report Post
Tools for Successful Documentation Projects
lwn.netยท5dยท
Discuss: Hacker News
๐Ÿ“šDocumentation Archaeology
Preview
Report Post
Odysseus: Jailbreaking Commercial Multimodal LLM-integrated Systems via Dual Steganography
arxiv.orgยท3d
๐Ÿฆ Parasitic Storage
Preview
Report Post
A case study in PDF forensics: The Epstein PDFs
pdfa.orgยท5dยท
Discuss: Hacker News
๐Ÿ“„PDF Archaeology
Preview
Report Post
internetarchive/wayback-machine-webextension: A web browser extension for Chrome, Firefox, Edge, and Safari 14.
github.comยท4dยท
Discuss: Hacker News
๐ŸŒWeb Archives
Preview
Report Post
Diacritic Restoration for Low-Resource Indigenous Languages: Case Study with Bribri and Cook Islands M\=aori
arxiv.orgยท4d
๐Ÿ“Concrete Syntax
Preview
Report Post
google.github.io/typograms/
google.github.ioยท6d
๐Ÿ–‹Typography
Preview
Report Post
Retrieval-Augmented Generation for Large Language Models: A Survey
paperium.netยท6dยท
Discuss: DEV
๐ŸŒ€Brotli Internals
Preview
Report Post